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ABSTRACT 

We report on the discovery and validation of Kepler-452b, a transiting planet identi¬ 
fied by a search through the 4 years of data collected by NASA’s Kepler Mission. This 
possibly rocky 1.63 ^q 20 -R® planet orbits its G2 host star every 384.843^QQ'j’2 days, the 
longest orbital period for a small {Rp < 2 R®) transiting exoplanet to date. The like¬ 
lihood that this planet has a rocky composition lies between 49% and 62%. The star 
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has an effective temperature of 5757 ± 85 K and a log g of 4.32 ± 0.09. At a mean 
orbital separation of 1.0461 q'qJ 5 AU, this small planet is well within the optimistic hab¬ 
itable zone of its star (recent Venus/early Mars), experiencing only 10% more flux than 
Earth receives from the Sun today, and slightly outside the conservative habitable zone 
(runaway greenhouse/maximum greenhouse). The star is slightly larger and older than 
the Sun, with a present radius of l.ll^QQg Rq and an estimated age of ~6Gyr. Thus, 
Kepler-452b has likely always been in the habitable zone and should remain there for 
another Gyr. 

Subject headings: planets and satellites: detection — stars: fundamental parameters — 
stars: individual (Kepler-452, KIC 8311864, KOI-7016.01) 


1. Introduction 



The field has advanced at a rapid pace ever since, enabling discoveries at longer orbital periods and 
smaller sizes. 


Today ~1500 exoplanets have been found through a variety of means, including Doppler sur¬ 
veys, ground- and space-based transit surveys, g ravitational microlen sing surveys (starting in 1996 
with OGLE-2003-BLG-235/MOA -2003-BLG-53, Bennett et al. 2006 1. and recently, direct imaging 
(beginning in 2011 with HR 8799. ISoummer et al.ll201lh . As impressive as the progress has been in 
this nascent field of astronomy, perhaps the most exalted goal, and one that remains tantalizingly 
close, is the discovery of a sufficient number of (near) Earth-Sun analogs to inform estimates of 
the intrinsic frequency of Earth-size planets in the habitable zone of Sun-like stars. 

The habitable zone is an evolving concept, here taken to be that range of distances about a 
star permitting liquid water to pool on the surface of a smal l rocky planet. For our purposes, we 
adopt the optimistic habitable zone as per iKopparapu et al.l (120131 1 with the insolation flux, Seg, 
between that experienced by recent Venus as the inner edge, and that experienced by early Mars 
as the outer edge. While the exact values depend somewhat on the effective temperature of the 


^Curiously, the first exoplanet transit survey focused on the smallest known eclipsing binary, CM draconis, to 
search for circumbinary planets, an endeavor that would not succeed until the Kepler Mission. 
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star, the optimistic habitable zone lies within the range from 
experienced by earth today, that is from 0.2 50 to 1.8 50 . 


^ 20 % to ~180% of the radiation 


To date, nine small planets with radii less than 2.0 ii 0 have been found in the 4 years of 
Keyler daXa. in or near the habitable zon es of their stars . The se discoveries include Kepler-62e 


and f (Borucki et a 


planets report ed by 


2013lb Kepler-1 86f ( Quintana et al. 2014 ). a few among the multitude of 


Rowe et al.l (2014l i: Kepler-283c, Kepler-296e and f; and a set of small planets 


announced in [Torres et al.l (120151 ): Kepler-438b, Kepler-440b, and Kepler-442b. The repurposed 
Kepler Mission, dubbed K2, h as also added a small p lanet on the inside edge of its habitable zone: 
K2-3d (aka EPIC 201367065d; Crossfield et al. 2015 ). Most of these small planets are beyond the 
reach of Doppler surveys and cannot be confirmed independently by radial velocity detections, and 
therefore must be validated statistically. Follow-up observations are conducted to rule out, to a 
very high statistical significance, scenarios in which astrophysical signals can mimic the planetary 
transits under investigation. 

Doppler surveys have been successful for a handful of very bright (nearby), cool stars (where 
the habitable zone is close-in and re flex velocity amplit ude is high). These possibly rocky planets 
(msini < 10Mm) i nclude GJ 16 3c (Bonfils et al. 2013 ). GJ 667 Cc (Anglada-Escude et al.l 2013 : 


Delfosse et al 


GJ 832c ( Wittenmver et al.ll2ni4l ). 


20131). HD 40307g (Tuomi et al.ll2ni3l ). Kapteyn b ( Anglada-Escude et al.ll2ni4l ). and 


All of these are orbiting s mall (R-t < R^), c ool < 5000 K) stars. According to the 
recent Kepler Q1-Q16 catalog ( Mullallv et al. 2015 ) available at the NExSGI exoplanet archive^ 
approximately 65 additional small planetary candidates have been identified orbiting in or near 
the habitable zones of stars over a much wider range of spectral types with effective temperatures 
from 2703 K to as high as 6640 K. All of these candidates need to be vetted thoroughly, given their 
importance to the prospect of reaching the primary Kepler Mission objective: to determine the 
frequency and distribution of terrestrial planets in the habitable zones of Sun-like stars. 


Here we present the detection and statistical validation of Kepler-452b, a possibly rocky 1.6- 
planet orbiting in the habitable zone of its G2 host star KIG 8311864 every 384.84 days, the 
longest orbital period for a small {Rp < 2 R^) transiting exoplanet discovered to date. In this 
paper, we apply the BLENDER analysis technique (see Section [9|) to validate the planetary nature of 
Kepler-452b , but note that siniilar approaches have been developed by ot hers and are in use in 
transit surveys (see, e.g. lMortonll2ni2l : iBarclav et al.ll2ni3l : iDfaz et al.ll2ni4l ). 


The paper is organized as follows: Section [2] describes the initial discovery of the transit 
signature of this super Earth in 2014 May. Section [3] presents the photometry for this transiting 
planet, describing the discovery of the transit signature in the data, and section [4] describes the 
results of several tests performed in the Kepler science pipeline to help validate the planetary 
nature of the photometric signature. Section [5] describes the spectroscopic measurements obtained 


^http: //exoplanetarchive . ipac . caltech. edu/cgi-bin/ExoTables/nph-exotbls?dataset=cuniulative_oiily 
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for this host star, and section [6] presents the stellar properties obtained for KIC 8311864. The 
analytical approach used to fit the light curve and determine the planetary properties is described 
in Section [71 Section [8] details the adaptive optics observations made to search for close companions. 
Section [9] describes the results of a full BLENDER validation analysis, which indicate that the planet 
hypothesis is favored at an odds ratio of 424:1, providing high confidence that this is, indeed, a 
planetary signature. Section [10] discusses the history of Kepler-452b in the context of the evolving 
habitable zone of its host star. Section m summarizes the results. The appendix presents a 
mathematical development of the bootstrap diagnostic calculation. 


2. Discovery 


Kepler-452b was discovered in a test run of the Kepler Science Operations Center (SOC) 9.2 
codebase in 2014 May when one of us (J. Twicken) inspected the planet search pipeline results 
to assess performance of an enhanced pipeline codebase for small, cool planets. According to the 
Data Validation pipeline module, the transit signature of this object featured four 10.5hr, 200-ppm 
deep transits spaced 384.846 days apart, a radius of 1.1 TJq, and an equilibrium temperature of 
221 K assuming an albedo of 0.3 and full redistribution of heat. None of the Data Validation 
diagnostics definitively rejected the planetary nature of this signature, and while both the radius 
and the equilibrium temperature proved to be significantly larger once better stellar parameters 
were obtained, this was clearly a highly interesting threshold crossing event (TCE). 


This transit signature was not identified in the previous search through all 4 ye ars of Kepler 
data with the SOC 9.1 codebase conducted in 2013 August (jTenenbaum et al.ll2014l ). most likely 
due to overly aggressive consist ency checks put in to place to reduce the large number of false 
alarms by instrumental effects (ISeader et al.l [2013) • After all 4 years of Kepler data had been 
re-processed with the deployed SOC 9.2 codebase, Kepler-452b’s tran sit signature was re -identified 
in the recent run of the planet search pipeline in 2014 November (jSeader et al.ll2015l ). The re¬ 
processed light curves and calibrated pixels have been archived at the Mikulski Archive for Space 
Telescopes (MAST) as data release 24, and the transit search results have been archived at NASA’s 
Exoplanet Science Institute’s exoplanet archive|§ This planet has been designated as KOI-7016.01 
and appears in the online KOI catalog table on NExScI with the first delivery of the Q1 ~ Q17 
KOIs. We expect that several more terrestrial planetary candidates will be identihed when all the 
new SOC 9.2 TCEs are fully vetted. 


This is the first planetary signature detected for KIC 8311864, which is noteworthy, given 
the long orbital period and the h igh fraction of multiple transiting planet systems identified by 
Kepler (see e.g., iRowe et al.[ [20141 1. If there are planets in orbits interior to that of Kepler-452b, 
either they are inclined so as not to present transits or they are too small to detect against the 


®Available at http: //exoplanetarchive . ipac. caltech. edu/cgi-bin/TblView/nph-tblView?app=ExoTbls&conf ig=ql_ql7_dr24_1 
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photometric noise for this 13.4 mag star (which averaged ~40ppm at 10.5 hr timescales across the 
4 year dataset). 

The stellar parameters for this star at the time of the test run included an effective temperature 
of 5578 K but a radius of only 0.79 720, yielding a planet with a correspondingly small radius of 
1.1 720 at a mean separation of 0.99 AU. The small stellar radius was due to an anomalously high 
log g of 4.580 Given that the star was likely to be significantly larger than 0.79 Rq on account of its 
temperature, we sought reconnaissance spectra in order to better pin down the stellar parameters 
and eventually obtained a spectrum with the Keck HIRES spectrometer. These measurements 
indicated that the surface gravity of the star was overstated by the KIC, and thus imply that the 
planet is ~45% larger than it appeared in the Data Validation summary report produced in the 
2014 May test run of the Kepler pipeline. 


3. Kepler Photometry 


The Kepler telescope obtained nearly continuous photome try in a 115 deg^ field of view (FOV) 
near Cygnus and Lyra for nearly 4 years (jBorucki et al.l 120101 ). During this period (2009 May 13 
through 2013 May 11), Kepler observed 111,800 stars nearly continuously and altogether observed 
a total of more than 200,000 stars over its operational mission. The Kepler observations were 
broken nominally into 3 month (93 day) segments, dubbed quarters, due to the need to rotate the 
spacecraft by 90° to keep its solar arrays pointed toward the Sun and its sunshade properly oriented 
( Koch et al. 201C). 

The star Kepler-452 (Kepler Input Catalog (KIC) = 8311864, a = 19^44™00!883, 6 = 44°16'39'.'22, 
J2000, KIC r = 13.367mag) was observed during each of the 17 quarters (Ql, Q2, ..., Q17) re¬ 
turned by Kepler^ The first and last quarters were very much abbreviated comp ared to the other s: 

Ql lasted 34 days and took place immediately after commissioning was complete (jHaas et al.ll201Cll L 
while Q17 lasted only 32.4 days, being truncated by the failure of a second reaction wheel, thus 
ending the data collection phase of the Kepler Mission. 

Figure [U shows all 17 quarters of the systematic error-corrected flux time series data for this 
star, with the locations of the transits identified by triangle markers0 The hrst transit takes 
place during Q3 at a barycentric-corrected Kepler Julian date (BKJD)0of 314.985lQQ(g, and three 


"‘The original KIC value for the log g of 4.99 I Brown et al. 201 ill was corrected by moving the stellar properties 
to the nearest Yonsei-Yale isochrone, giving a still somewhat high value of 4.58. 


set of ~53,000 stars including KIC 8311864 were observed for a 10 day period at the end of the commissioning 
period for Kepler that is called QO. This data set is not used in the search for planets or for constructing pipeline 
diagnostics in the Data Validation module, and is not used in this analysis except where otherwise noted. 

®A11 Kepler photometric data products can be found at the MAST archive at http://archive.stsci.edu/kepler. 

^Barycentric-corrected Kepler Julian date (BKJD) is defined as BJD—2,454,833.0. 
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additional transits occur every 384.843 ^q'q° 2 days, one each in Q7, Qll and Q15. The transits each 


last approximately 10.5 hours and are 200 ppm deep. The scatter in the photometric data in Figured] 
is quite uniform in all quarters and there is no obvious evidence of the electronic image artifacts 
that plague a subset of the Kepler target stars as they rotate through the handful of affected CCD 
readout channels, resu l ting i n a large number of false alarm TCEs (|Van Cleve &: Caldwellll2009l : 


Tenenbaum et al 


20131 . 12 OI 4 I ). Indeed, this star does not fall on any of the CCD channels associated 


with these image artifacts in any observing season. 

The planet was detected at the 9.7 cr level, well above the 7.1a threshold and passed the 
other co nsistency checks exercised by the Transiting Planet Search (TPS) component of the SOC 
pipeline (Jenkins 2002; Jenkins et al. 2010 : Seader et al. 2013 : Tenenbaum et al. 2013 . 2014 '). The 
light curve with its putative transit signature was fitted with a iMandel and Agoll (j2002l ) limb- 
darkene d transit waveform and subjected to a suite of diagnostic tests by the Data Validation 
module (|Tenenbaum et al.ll2010l : IWu et al.ll2010lb Figure [2| shows the phase-folded, detrended light 
curve, zoomed on the transit epoch. The individual transits are significant at the ^4.75 a level, so 
that each can be credibly seen by eye in the long cadence (LC) data at a 29.4 minute resolution. 
The DV diagnostics appeared to be consistent with the planetary hypothesis, so we coordinated 
follow-up observations to help refine the stellar (and hence planetary) properties and to further 
bolster the case for the planetary nature of this object. 


The following section describes the results of the validation tests performed in DV. 


4. Data Validation Tests 

This section describes the most salient validation tests performed inside of the Data Valida¬ 
tion component of the SOC pipeline. The DV report resulting from the official SOC 9.2 processing is 
available at http://exoplanetarchive.ipac.caltech.edu/data/KeplerData/008/008311/008311864/dv/kpl] 


4.1. Discriminating Against Backgronnd Eclipsing Binaries 

Data validation applies several tests for the presence of background eclipsing binaries. The first 
is the odd/even test: DV fits a limb-darkened transit model to the odd transits and an independent 
model to the even transits, and then compares the depth fitted to the odd transits to that fitted 
to the even transits. This results in a test statistic of 0.14 u with a corresponding significance of 
89.O%0 In addition, DV examines the ratio of the temporal offsets between the odd and even 


®The reprocessed light curves are available at http://archive.stsci.edu/kepler/ and the release notes can be 
found at http: //archive.stsci.edu/kepler/release_notes/release_notes24/KSCI-19064-001DRN24.pdf 

®The significance of these statistical tests is relative to the null hypothesis. That is, a significance near 1 indicates 
a null result (not an eclipsing binary) while a significance near zero indicates that the alternative hypothesis is highly 
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transits compared to the fitted orbital period, the so-called “epoch” test, giving a result of 0.10 a 
and a corresponding significance of 92%. Figure [3] illustrates the odd/even test by plotting the light 
curve data phase folded separately for the odd and even transits. No other TCEs were generated 
for this star and no evidence was seen for weak secondary eclipses. Based on these tests, we see 
no evidence for systematic differences in the odd/even transit depths and durations which often 
result in the case of background eclipsing binaries, and the transit times are consistent with a strict 
periodic spacing. 


4.2. Statistical Bootstrap Test 


In order to better determine the statistical significance of the transit sequence, we examined the 


boot strap distribution for the population of null statistics for the out-of-transit data (jjenkins et al 


2002I . J. Jenkins & S. Seader, in preparation). This test relaxes the assumption that the observation 


noise is broadband colored Gaussian noise to establish a robust significance level for the detection 
statistic of the planet candidate. The appendix contains a mathematical derivation of the statistical 
approach and a brief description of its implementation. 


The false alarm rate is estimated using a bootstrap statistical analysis of the out-of-transit 
data for this star. The single event statistics time series, representing the likelihood of a transit 
of the fitted duration as a function of time, are drawn at random (with replacement) to construct 
artihcial multiple event statistics corresponding to the same number of transits that occurred for 
the TCE in question. The distribution of all such bootstrap samples provides a means to assess the 
statistical significance of the maximum multiple event statistic, the maxMES, that resulted in the 
identification of the TCE in the first place. This methodology relaxes the assumption of perfection 
on the part of the adaptive, power spectral density estimator in TPS that informs the pre-whitener, 
allowing for the detection signal-to-noise ratio (S/N) to be evaluated against the distribution of 
null statistics drawn directly from the observation noise of the light curve. The bootstrap false 
alarm rate for the transit signature of Kepler-452b is securely established as statistically significant 
by this test for the 9.7 a detection of this planet, at a level of 2.32 x 10“^®. This figure is well below 
that required by the Kepler experiment, ~ 1 x 10“^^, to limit the number of false positives to no 
more than one over the full mission. Eigure [4] illustrates the bootstrap test for this planet. 


4.3. Centroid Analysis 


Data validation performs two types of pixel-level centroid analysis (|Brvson et al.l 120131 ) to 
determine the location of the transit source relative to KIC 8311864. This analysis constrains the 
likelihood that transits observed in a light curve may be due to background eclipsing binaries by 


likely. 








providing a “confusion radius,” inside which a background transit source cannot be distinguished 
from the target star. Sources outside this confusion radius are ruled out as possible causes of the 
transit signal. This radius is passed to the BLENDER analysis of Section [9l 

The first analysis technique uses the change in the photocenter (centroid shift) of the flux on 
pixels when the transit occurs to infer the location of the source of the transit signal. The row 
and column of the photocenter of pixels collected for this target are measured for every LC in 
which the target was observed. These row and column time series are then fitted to the transit 
model (normalized to unit depth) derived from the light curve. The fitted amplitudes of the row 
and column fits provide the apparent centroid shift on the sky. The use of every observed LC in 
this fit allows a very high precision measurement of this centroid shift. Dividing this centroid shift 
by the transit depth gives an estimate for the transit source location relative to the target star. 
For KIC 8311864, the source offset was estimated as T.'832 at 1.63 u, consistent with the transit 
source being co-located with KIC 8311864. This estimate, however, tends to be biased so as to 
overestimate the source offset relative to the mean photocenter of the pixels. It also tends to be 
more vulnerable to noise, instrumental systematics, and other flux variations in the photometric 
mask than estimates obtained via the second technique. 

The second method consists of performing difference image analysis on in-transit pixel data 
compared with neighboring out-of-transit pixel data in each quarter. The average of the in-transit 
images is subtracted from the averaged out-of-transit images. When flux variation in the photomet¬ 
ric mask other than the transit is negligible, the transit source appears as a star in the difference 
image. Fitting the quarterly difference and out-of-transit images with the Kepler point-spread func¬ 
tion (PSF) then provides a direct measurement of the offset of the transit source from KIC 8311864 
per quarter. The quarterly offsets are combined via robust averaging to provide an estimate of 
transit source offset. The difference image analysis generally tends to be less vulnerable to noise 
and other flux variation than the brightness weighted centroid results. For shallow transits which 
occur only once per quarter, such as Kepler-452b, however, instrumental systematics can result in 
very poor (un-star-like) difference images whose PSF fit will result in significantly biased position 
estimates. When the systematics are sufficiently strong, the fit can fail to converge. 

For KIC 8311864, all four quarters containing transits had successful fits, shown as the green 
crosses in Figure [5j Two of the quarters, however, had very poor, non-star-like difference images 
and resulted in the two locations to the northwest of KIC 8311864. The other two quarters had 
good, star-like images, resulting in offsets that straddle KIC 8311864, slightly offset to the west. 
Because the two non-star-like difference images are not validly ht by the Kepler PSF (the difference 
images in these quarters do not contain anything that looks like a star), we exclude them from our 
analysis. The remaining two quarters give a transit source offset from KIC 8311864 of 01^782 ±0.33, 
or 2.35(T. This measurement formally has a 3a confusion radius of KO. 

To address the concern that a measurement using only two data points can be misleading, we 
estimate that a Gaussian uncertainty of C/.'50 would result in the observed uncertainty of 0l'33 half 
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the time. We therefore conservatively set the 3-cr confusion radius to V.b. Potential transit sources 
outside this 1"5 radius are ruled out. Possible unknown sources inside l".b are addressed in the 
BLENDER analysis of Section [9l 


4.4. Contamination by Other Sources 

In addition to the tests conducted by the pipeline module Data Validation on the Kepler 
data themselves, we also performed manual checks against contamination by other astrophysical 
sources in the Kepler FOV. The two principal sources of contamination are (1) optical ghosting 
from eclipsing binaries well offset from the target star due to reflections off of optical elements in the 
Kepler telescope, and (2) video crosstalk between CCD channels that are read out simultaneously. 


Optical ghosting, or direct PSF contamination, was the subject of a study by ICouehlin et al 


(|2014l l identifying 685 TCEs in the Q1-Q12 vetting process as false positives due to these and 
other effects. To determine whether optical ghosting was an issue, we looked for period-epoch 
collisions between Kepler-452b a nd the TCEs identihed in both the 2012 August search through 
Q1- Q16 (ITenenbaum et al.l 120131 ) and those from the recent 2014 September search through Ql- 


Q17 (ISeader et al 


201 


^). None were found. Nor were any TCEs found either at one half or one 


third of the orbital period of Kepler-452b with transit epochs matching the epochs of Kepler-452b. 


In addition, we queried the general catalogue of variable stars (GCVS) (ISamus et al.ll2012l ) 
for variable stars with periods within 10 hours of Kepler-452b. Only two stars in the GCVS had 
periods close to 384.84 days that were within ~6° of KIC 8311864. Both of these are pulsating 
variables, however, not eclipsing binaries, and therefore cannot explain the transits of Kepler-452b. 
One of them is EX CYG aka KIC 4671784, which was observed in LC mode throughout KepleKs 
mission. The other variable star is V1299 Cyg which is more than 1° from the Kepler EOV and 
thus cannot be the source of any contamination. No variable stars were found in the GCVS in or 
near Kepler's EOV with periods near one half or one third that of Kepler-452b. The likelihood 
that optical ghosting could explain the transits of KIC 8311864 is further diminished by the fact 
that most optical ghosts tend to create multiple “children” signals in the Kepler light curves as the 
ghost images are defocussed or severely scattered. 

Video crosstalk occurs when analog voltage signals in one video circuit impress th emselves 
onto other video circuits in the readout electronics boards (jVan Cleve fc Caldwelll l2009l . p. 71), 
contaminating the other CCD readout values. This represents an additive noise source and the 
contamination is an attenuated version of the original signal, but can either be positive or negative, 
with a typical value of ~4400 for the attenuation factor (or a gain factor of 2.3 x 10“^). While video 
crosstalk does occur between channels read out at the same time on different CCD modules, the 
attenuation is so high that the only appreciable video crosstalk occurs within each pair of CCDs 
mounted together on the same module. To rule out contamination by video crosstalk, we examined 
the quarter 3 month 2 full frame image (EEI) kplr2009322233047_ff i-cal. f its obtained on UT 
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2009 November 18 for evidence of bright stars read out on the same pixels as KIC 8311864 on the 
adjacent CCD channels 


For a target star of brightness B experiencing video crosstalk from a perfectly situated eclipsing 
binary (or transiting giant planet) of brightness h and an eclipse (or transit) depth of 6b/h on an 
adjacent CCD readout channel, the apparent transit depth exhibited by the target star is given by 


Tdep X 10-6 


a6h a 5b/h 
B + ab B/b + a 
a 6b/b _ 6b b 
B/b ~ ^Tb' 


( 1 ) 


where the second line arises for the case of interest here as a <C 1. 


All four transits of KIC 8311864 occur on CCD module/output 7.3, as the orbital period is close 
to one year and the first transit occurs sufficiently early in Q3 so that the subsequent three transits 
take place when the Kepler photometer is in the same seasonal orientation. Thus, only stars on the 
adjacent channels 7.1, 7.2, and 7.4 could produce video crosstalk contamination for Kepler-452b. 
Of these, only channel 7.1 has a readily visible star [a = 19^44™27®.95, 6 = 42°46'13".55) near the 
same row/column coordinates as for KIC 8311864 on channel 7.3. Its peak pixel actually is just 1 
row below the optimal aperture for Q3, so only flux from the wing of the PSF could be getting into 
the aperture of KIC 8311864. No Kepler light curves exist for this star since it was not selected as 
a target during the 4 year Kepler Mission. 

This star on channel 7.1 is about 3 mag fainter than KIC 8311864 and the video crosstalk 
coefficient for channel 7.1 on channel 7.3 is 2.3820x10-6. Hence, a full eclipse (50% drop in 
brightness) on this star on channel 7.1 would produce an apparent transit depth of ~0.75ppm. 
A 1% giant planet transit on this star would impute an even smaller apparent transit depth of 
~0.015 ppm. We can thus eliminate this star as a potential source of confusion as it cannot credibly 
cause a transit-like feature deep enough to explain the 199 ppm transits exhibited by Kepler-452b. 

Channels 7.2 and 7.4 are also incapable of producing a false positive here through video 
crosstalk. Channel 7.2 has a negative crosstalk coefficient with 7.3 (-1.9759x10-^) and 7.4 has 
a crosstalk coefficient of 1.276x10-^, but no apparent star in the pixels corresponding to the aper- 
tnre of KIC 8311864. 


We conclude that contamination through either optical ghosting or video crosstalk is highly 
unlikely for Kepler-452b. 


lOpFIs are available at http://archive.stsci.edu/kepler/ffi/search.php 






-11 - 


5. Spectroscopy 

Due to the curiously small stellar radius of KIC 8311864 for the effective temperature available 
at the time of the 2014 May planet search, it was clear that reconnaissance spectroscopy was called 
for in order to obtain a reasonable interpretation of the nature of the source of the transit-like 
features in the Kepler light curve. We obtained one spectrum from the McDonald Observatory 
in 2014 May and two additional spectra from the Pred L. Whipple Observatory in 2014 June. To 
improve the stellar parameters derived from spectroscopy, we obtained a spectrum from the Keck 
I observatory with the HIRES instrument in 2014 July. The HIRES data were analyzed with 
three independent analysis packages in order to better understand the effects of systematics in the 
software codes and to determine the most reasonable stellar parameters to use for interpreting the 
photometry in this system. 


5.1. Reconnaisance Spectroscopy wfith the Tull spectrograph at the McDonald 

Observatory 2.7 m Telescope 


We obtained a reconnaissance spectrum for KIC 8311864 using the Tull Coude spectrograph 
(|Tu 11 et al.l 1 19951 1 at the Harlan J. Smith 2.7 m telescope at McDonald Observatory. We used the 
standard instrumental setup that covers the entire optical spectrum and uses a l'!2 slit that yields a 
resolving power of R = 60,000 with 2-pixel sampling. We observed the star in the night of UT 2014, 
18 May. The exposure time was 1720 seconds and the resulting spectrum has a S/N of around 31:1 
(per resolution element) at 5650A. We retrieved stellar parameters using our new spectral fitting 
tool Kea. Kea was specifically developed to determine reliable stellar parameters from the lower 
S/N reconnaissance spectra of our Kepler follow-up program at McDonald Observatory. 


We used the MOOG (jSnedenlll973l ) stellar spectral code in its spectral synthesis mode with the 


Kuruca (Il993l ) stellar atmospheric models to construct a grid of synthetic stellar spectra. We used 
atomic line parameters obtai ned from the VALD (Vien na Atomic Line Data base). We in cluded 
molecular opacities for MgH ( Hinkle et al. 2013 1. TiO (Plez 1998 1 and CN ( Sneden 2014 1. The 
spectral grid covers a range of effective temperature from 3,500 to 10,000 K in 100 K steps, a 
stellar metallicity of [Fe/H] from -1.0 to -1-0.5 dex in 0.25 dex steps, and a surface gravity log(g) of 
1.0 to 5.0 dex in steps of 0.25 dex. We used the MOOG “weedout” feature to eliminate atomic and 
molecular lines in the line list with a ratio of line to continuum opacity of less than 0.001. 


Kea compares 12 orders of the continuum-normalized Tull spectrum with the synthetic grid 
using statistics. For this comparison we convolve the synthetic spectra with a Gaussian-shaped 
PSF to assure the same spectral resolution of model and data. We also rotationally broaden the 
model spectra to find a best match for the (projected) v^ot of the star. The final stellar parameters 
are obtained from the mean and standard deviation of the best-fit models of the spectral orders 
that Kea is using. A more detailed description of this method will be presented in a forthcoming 
publication (M. Endl &: W. Cochran, in preparation). From the KIC 8311864 reconnaissance 
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spectrum, we determined an effective temperature of Tgff = 5650±108 K, a metallicity of [Fe/H] = 
0.21 zb 0.07 dex, a surface gravity of logg = 4.45 zb 0.16, and a rotational broadening of v^ot = 
4.30 zb 0.5 km s"^ for KIC 8311864. 


5.2. Reconnaisance Spectra Obtained from TRES 


We obtained two spectra using the Tillinghast Reflector Echelle Spectrograph (TRES; |Fures 2 


2008|), on the 1.5 m telescope at the Fred Lawrence Whipple Observatory (FLWO) on Mt. Hopkins, 
Arizona, which has a resolving power oi R = 44, 000 and a wavelength coverage of r\j 3900-9100 A. 
Th e spectra were taken on UT 2014, June 09 and UT 2014, June 14 and were extracted as described 
by Buchhave et al. (2010). 


We used the Spectral Parameter Classification (SPC) (jBuchhave et al.l l2012l i technique to 
determine stellar parameters. SPC cross correlates an observed spectrum against a grid of synthetic 
spectra using the correlation peak heights to det ermine the be st fit parameters. The synthetic 
spectra are based on Kurucz model atmospheres (jKurucall993l ). We allowed Teg, log g, [Fe/H], 
and i^sinz to float as free parameters. The S/N per resolution element on the TRES spectra were 
slightly lower than usual when running SPC so these results should be viewed with caution. A 
weighted average of the results from the two spectra gave Tefj = 5751 zb 55 K, log g = 4.43 zb 0.10, 
[Fe/H] = 0.40 zb 0.08, and usini = 4.2 zb 0.5kms“^. The maximum height of the normalized cross 
correlation function (CCF) was 0.916 and the combined S/N of the spectra was 23.5. 


5.3. HIRES Spectroscopy 


On UT 2014, July 3, we acquired a high resolution spectrum of KIC 8311864 with the Keck I 
Telescope and HIRE S spectrometer. We used the standard setup of the California Planet Search 
([Howard et al.ll2010l ) without the iodine cell, resulting in a spectral resolution of 60,000, and S/N, 
at 5500 A, of 40 per pixel. 

SpecMatch compares a target stellar spectrum to a library of 800 spectra from stars that 
span the HR diagram (Teff = 3500-7500 K;log g = 2.0-5.0). Parameters for the library stars 
are determined from LTE spectral modeling. Once the target spectrum and library spectrum 
are placed on the same wavelength scale, SpecMatch computes the sum of the squares of the 
pixel-by-pixel differences in normalized intensity. The weighted mean of the ten spectra with the 
lowest values is taken as the final value for the effective temperature, stellar surface gravity, and 
metallicity. SpecMatch-derived stellar radii are uncertain to 10% RMS, bas ed on tests of star s 
having known radii from high resolution spectroscopy and asteroseismology ([Huber et al.l l2013l i . 
The stellar parameters determined using SpecMatch were Teff = 5757zb60 K, log g = 4.32zb0.07 dex 
and [Fe/H] = 0.21 zb 0.04. 
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Two other analyses were carried out on the HIRES spectrum to better understand the role of 
instrumental systematic errors between the three sets of observations, and to better determine how 
to set the error bars on the resulting stellar parameters. 

An SPC analysis of the HIRES spectrum gave the following stellar parameters: = 5740 ± 

50K, log g = 4.42 ± O.lOdex, [Fe/H] = 0.22 ± 0.08, z^sini = 0.8 ± 0.5kms“^, maximum height of 
the CCF = 0.988, and S/N = 58.8 per resolution element. 

We also performed a standard EW analysis of Eel and Fell lines using MOOG and Kurucz’s 
odfnew models. We determined two sets of Teg, log g, [Fe/H], and vt (microturbulence) values, 
one using only the Keck spectrum and a second using a solar spectrum from MIKE/Magellan 
for line-by-line differential analysis. The solution based solely on the Keck spectrum gave Tgff = 
5740 ± 35 K, log g = 4.230 ± 0.072 dex, A(Fe) = 7.649 ± 0.033 (which would correspond to [Fe/H] 
= -1-0.2 for an absolute solar iron abundance of A(Fe) = 7.45, as inferred by numerous ID-LTE 
analyses), and vt = 1-03 ± 0.08kms“^. The solution with the Keck spectrum -|- MIKE/Magellan 
solar spectrum template gave Tefj = 5818 ± 21 K, log g = 4.33 ± 0.05 dex, [Fe/H] = 0.24 ±0.02, and 
micro turbulent velocities, vt = 1-07 ± 0.05 kms“^. 


As an additional search for unseen companions, we searched the HIRES spectrum for secondary 
spectral lines caused by a second star that is angularly close enough to the primary to fall into 
the HIRES spectrometer slit (Cf.'87 wide). The observed spectrum is first compared to a library 
of representative main sequence stars. The best match library spectrum is subtracted from the 
observed spectrum , and the residuals are then inspected for evidence of a secondary spectrum 
(jKolbl et al.ll2015l b We found no secondary spectrum brighter than 1% of the primary, separated 
in radial velocity by more than 10 km s“^. Stars separated by less than 10 km s“^ cannot be detected 
because they overlap with the primary spectrum. See section [9] for more details. 


Table [U contains the results for the spectroscopy of KIC 8311864 and attendant analyses. We 
considered the scatter and precision in the spectroscopic results in order to determine the most 
reasonable stellar parameters for the recovery of planetary parameters, as described in the next 
section. 


6. Host Star Properties 


Since KIC 8311864 is too faint for direct observations such as interferometry or asteroseismol- 
ogy, the derivation of stellar mass, radius and density relies on matching atmospheric properties to 
interior models. Figure [6] compares the spectroscopic solutions described in the previous section. 
The values are generally within 1 cr and no strong correlation of log g with Tgff or [Fe/H] is apparent, 
indicating that the differences are dominated by diffe rences in the methods rather than degenera¬ 
cies in atmospheric properties le.g.. iTorres et al.ll2012l l. On average, KIC 8311864 is slightly cooler, 
slightly larger, and about 60% more metal-rich than the Sun. 
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Table 1. Spectroscopy and Analysis Results for KIC 8311864 


Parameter 

Value 

Notes 

Tull/McDonald Observatory 



Effective temperature (K) 

5650 ± 108 

a 

Surface gravity log g (cgs) 

4.45 ± 0.16 

a 

Metallicity [Fe/H](dex) 

0.21 ± 0.07 

a 

Projected rotation Velocity i/sini (kms“^) 

4.3 ± 0.5 

a 

TRES/Whipple Observatory 



Effective temperature (K) 

5751 ± 55 

b 

Surface gravity log g (cgs) 

4.43 ± 0.10 

b 

Metallicity [Fe/H](dex) 

0.40 ± 0.08 

b 

Projected rotation Velocity i/sini (kms“^) 

4.2 ± 0.5 

b 

HIRES/Keck Observatory 



Effective temperature (K) 

5757 ± 60 

c 

Surface gravity log g (cgs) 

4.32 ± 0.07 

c 

Metallicity [Fe/H](dex) 

0.21 ± 0.04 

c 

Projected rotation Velocity z/sinz (kms“^) 

<1 

c 

Effective temperature (K) 

5740 ± 50 

b 

Surface gravity log g (cgs) 

4.42 ± 0.10 

b 

Metallicity [Fe/H](dex) 

0.22 ± 0.08 

b 

Projected rotation Velocity usini (kms“^) 

0.8 ± 0.5 

b 

Effective temperature (K) 

5818 ± 21 

d 

Surface gravity log g (cgs) 

4.33 ± 0.05 

d 

Metallicity [Fe/H](dex) 

0.24 ± 0.02 

d 


Note. — 

a: Analyzed with Kea. 
b: Analyzed with SPC. 
c: Analyzed with SpecMatch. 
d: Analyzed with MOOG. 
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To ensure self-consistency, we adopted a single spectroscopic solution for the remainder of the 
paper. We chose the SpecMatch solution, which yields an intermediate temperature, conservative 
surface gravity, and metallicity consistent with most other methods. To account for the systematic 
differences between methods, we added the standard deviation of all methods for a given param¬ 
eter in quadrature to the formal errors provided by SpecMatch. The final adopted atmospheric 
properties are listed in Table [2l 


To derive inter ior properties, we fitted Teg, log g, and [Fe/H] to a grid of Dartmouth isochrones 
(jPotter et al.N2008lh The grid was calculated using solar-scaled alpha-element abundances, and 
interpolated to a stepsize of 0.01 Mq in mass and 0.02 dex metallicity. We fitted the input values to 
the model grid using a Markov chain Monte Carlo (MCMC) algorithm which, at each step, draws 
samples in mass, age, and metallicity. For ease of computation, the samples in age and metallicity 
were drawn in discrete steps corresponding to the sampling of the model grid (0.5 Gyr in age and 
0.02 dex in [Fe/H]). For each sample of fixed age and metallicity, we linearly interpolated the grid 
in mass to derive and log g, which were then used to evaluate the likelihood functions. We 
adopted uniform priors in mass, age, and metallicity. We calculated 10® iterations and discarded 
the first 10% as burn-in. 


Figure [3 shows the marginalized and full posterior distributions, as well as the position of 
KIC 8311864 on a radius-temperature diagram. As expected, KIC 8311864 is slightly evolved off 
the zero-age main-sequence. The 68% confidence interval for the stellar radius ranges from 1.02- 
1.26 420, with a most probable value of 1.11 i^Q. 


As a consistency check, we also computed the age, mass, and radius of this star based on 
the MQOG a nalysis of the HIRES spectrum using Yons ei-Yale isochrones (IDemaraue et al.ll2004l : 


Yi et al.ll2004l l and the approach detailed in Section 4.5 of iRamfrez et al.l (j2014l h obtaining an age of 


5.2 (3.4-7.5) Gyr, a mass of 1.07 (1.02-1.12) Mq, and a radius of 1.12 (1.03-1.29) Rq. The values in 
parentheses represent approximate ±1 cr ranges of values. Note that the agreement in these derived 
quantities with those derived for the SpecMatch results is very high, giving us confidence that we 


Table 2. Stellar Properties of KIC 8311864 


Parameter 

Value 

Derived from high-resolution spectroscopy using SpecMatch 

Effective Temperature (K) 

5757 ± 85 

log [Surface gravity] (dex) 

4.32 ± 0.09 

Metallicity [Fe/H] (dex) 

0.21 ±0.09 

Derived from fitting log g and [Fe/H] to isochrones 

Radius (Rq) 

1 ^^+0.15 
-L-J--L_0.09 

Mass (Mq) 

1 nq7+0-054 

±.uo/ _o.047 

Mean Density (gcm~^) 

0.84l[];« 

Age (Gyr) 

6 zb 2 
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have a reasonable degree of control over the systematics in the different analysis techniques. 


7. Properties of the Planet Kepler-452b 


To estimate the posterior distribution on each htt ed paramete r in our transit model , we used an 
MCM C approach similar to the procedure outlined in IPord (2005) and implemented in lpowe et al.l 
(|2014l l. Our algorithm used both a Gibbs sampler and a control set of parameters to enable a 
ve ctor jumping di stribution to efficiently handle the sampling of correlated parameters as outlined 


m 


Gregorvl (|201lh . We directly htted the following transit and orbital parameters; orbital period 
(P), epoch, scaled planet radius (Pp/i?*), impact parameter (b), and eccentricity parameters 
(ecos (w) and esin (u;)). We generated ten sets of 1,000,000-element Markov chains, then discarded 
the hrst 20% of each chain to establish burn-in. The remaining sets were combined and used to 
calculate the mode, standard deviation, and 1 cr bounds of the distribution centered on the mode of 
each model parameter. Our model hts and uncertainties are reported in Table [3l The eccentricity 
of Kepler-452b was not well constrained by the light curve as was expected in the absence of a 
radial velocity detection of the planet. 


We used the Markov chains to derive model dependent measurements of the transit depth, 
Tdep, and transit duration, T^ur- The transit depth is dehned as the transit-model value when the 
projected distance between the star and planet is minimized. The transit duration is the time from 
first to last contact. We also convolved the transit model parameters with the stellar parameters 
(see Section [6]) to compute the planetary radius, Rp = 1.6 P®, and the flux received by the planet 
relative to the Earth {S = 1.1 S©). We also performed a fit forcing the orbit to be circular in our 
transit model and obtained thereby an estimate for the density of the star of p* = l.l±0.3gcm“^. 
While this value is perfectly consistent with the spectroscopic work, it suggests that both the star 
and the planet may be somewhat smaller than indicated by the SpecMatch analysis. 


8. Adaptive Optics Observations 


KIC 8311864 has one companion visible in the UKIRT image (see Figure [5]) at a separation of 
~4". This dim Kp = 18.435 star is well outside the 3 a confusion radius and thus cannot be the 
source of the transit-like features in the Kepler photometry, according to the centroid arguments 
presented in Section 14.31 However, it is important to probe the scene around KIG 8311864 for 
evidence of nearby companions that would not be identihable at the spatial resolutions in either 
the Kepler or the UKIRT image data. 


We obtained near-infrared adaptive optics (AO) images in the J (1.248 um ) and Br-y (2.157 


/rm) f ilters using the NIRG2 imager behind the natural guide star AO system (jWizinowich et al 


20041 : 1 Johannson et al.l l2008l ) at the Keck II telescope on UT 2014, June 13. Data were acquired 


in a three-point dither pattern to avoid the lower left corner of the NIRG2 array, which is noisier 
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Fig. 1.— Detrended flux time series for Kepler-452 versus barycentric-corrected Kepler Julian day 
(BKJD - 2,454,833.0). The triangle symbols indicate the locations of each of the four transits which 
occur in quarters 3, 7, 11 and 15. All 4 years of Kepler data are plotted here with vertical dashed red 
lines indicating the boundaries between consecutive quarters. The detrending was accomplished by 
subtracting the result of passing the systematic error-corrected light curve through a moving median 
filter with a window width of five times the ~10.5hr pulse duration. The light curve residuals are 
remarkably uniform across all 17 quarterly data segments, showing no evidence of electronic image 
artifacts which are responsible for a large number of long period false positive transit-like features 
in the Kepler light curves. 
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Table 3. Planet Parameters for Kepler-452b 


Parameter 

Value 

Notes 

Transit and orbital parameters 



Orbital period P (day) 

384.843lo°52 

a, b 

Epoch (BJD - 2454833) 

314.985l°yi| 

a, b 

Scaled planet radius Rp / R* 

0.0128l“;°0J3 

a, b 

Impact parameter b = a cos i/Ric 

0.69lgi« 

a, b 

Orbital inclination i (deg) 

89.806t°;«4 

a 

Transit depth T^ep (ppiri) 

19911® 

a 

Transit duration T^ur (i^r) 

1U.DO_q go 

a 

Eccentricity ecos (lj) 

o.03l°;^® 

a, b 

Eccentricity esin (lj) 

-o.02l°|l 

a, b 

Planetary parameters 



Radius Rp (R®) 

i.UO_o.20 

a 

Orbital semimajor axis a (AU) 

i.U4D_o.oi5 

a 

Equilibrium temperature Tequ (K) 

265111 

c 

Insolation relative to Earth 

1 iri-l-0-29 
-*^•^^-0.22 

d 


Note. — 

a: Based on the photometry, 
b: Directly fitted parameter. 

c: Assumes Bond albedo = 0.3 and complete redistribution, 
d: Based on Dartmouth isochrones. 
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Phase [Hours] 


Fig. 2.— Phase-folded, detrended flux time series for Kepler-452 versus phase in hours. The flux 
time series plotted here is the same as that of Figured] except that it has been phase folded at the 
period of the planet, 384.84 days. The triangle symbol indicates the location of the four transits at 
zero phase. The full resolution data are represented by black dots. The cyan filled circles represent 
the results of binning the data in phase with a bin width of 1/5 of the DV-fitted duration of the 
transit, 10.5 hr. 


than the other three quadrants. The dither pattern was performed three times with step sizes of 
3^.T, 2'.'5, and 2'.T for a total of nine frames; individual frames had an integration time of 60 s (30 s 
times 2 coadds) in both bands. The individual frames were flat-fielded, sky-subtracted (sky frames 
were constructed from a median average of the 9 individual source frames), and co-added. 


The average AO-corrected seeing (FWHM) was 0'.'04635 and 0'.'05229 at J- and K-bands, 
respectively. Figure [8] shows the Keck AO images obtained for this star. KIC 8311864 appears 
to be a well isolated star in both bands with no obvious background stars, or resolved, associated 
companions within ±2" that would present blends at the 4" pixel scale of Kepler's camera. FigurejH 
shows the 5 a sensitivity limits for rejecting the presence of background confusion sources as a 
function of radial distance from KIC 8311864. The sensitivity achieved in both bands establishes 
rejection of background stellar sources from ~(/.'5 to 2'.'0 within Amag ~ 9 mag o f the central 


targe t star, in the bands of observation. Using typical stellar Kep/er-infrared colors (jHowell et al 


2 OI 2 I I. the limitations of the observations translate into the Kepler bandpass sensitivity limits 
near Amag ~10-12mag. Given that the transits of Kepler-452b are 199 ppm deep, a background 
eclipsing binary exhibiting 50% eclipses would need to be within 8.66 mag of Kepler-452. The AO 
images and the difference image analysis carried out with the Kepler data allow us to validate the 
planetary nature of the transit source with high confidence, as detailed in the next section. 


9. BLENDER analysis 

In this section, we examine the possibility that the Kepler-452b signal may be due to an astro- 
physical false positive. Typical examples of such phenomena include a background or foreground 
eclipsing binary along the line of sight to the target (i.e., within the photometric aperture of Ke- 
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Fig. 3.— Phase-folded, detrended flux time series for Kepler-452 versus phase in hours as in Fig. [21 
with the data for the even transits separated from that of the odd transits. The full resolution data 
are represented by black dots. The cyan filled circles represent the results of binning the data in 
phase with a bin width of 1/5 of the DV-fitted duration of the transit, 10.5 hours. The red horizontal 
lines are the fitted depths for the odd transits (left) and the even transits (right) and the dashed 
horizontal lines represent the ±1 a error bars on the fitted depths. 


pier), a chance alignment with a background or foreground single star transited by a (larger) planet, 
or a physically associated star transited by a planetary or stellar companion (hierarchical triple). 
We refer to the first and second of these scenarios as “BEB” and “BP,” respectively, and to the 
third as “HTP” or “HTS,” depending on whether the object producing the eclipses is a planet or 
a star0 


Other possible false positive scenarios include the transit of a white dwarf, or the occultation 
of a brown dwarf in an ec centric orbit with an orientation such that the transit does not occur (see, 


e.g., 


Santerne et al.ll2013l b The white dwarf scenari o can be ruled out becau se it would result in a 
very large microlensing signal of ~2400ppm, as oer iSahu &: Gilliland! (j2003l b that is not observed. 
A non-transiting, occulting white dwarf is unlikely to produce a 200-ppm deep transit-like feature 


^^Note that in this paper we consider the BP and HTP scenarios as false positives, even though they involve a 
transiting planet. This is because, for a given transit signal, a planet orbiting an unseen star in the aperture could 
be considerably larger than if it orbited the target, and therefore less interesting for our purposes, as the likelihood 
of its being rocky would be much smaller. 
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Fig. 4.— Bootstrap false alarm rate (FAR) versus maximum multiple event statistic for the 
light curve for Kepler-452 (blue curve), and that expected for ideal Gaussian noise fitted to the 
empirical FAR between 1 x 10“^ and 1 x 10“^^ (dashed green curve). The Gaussian fit allows us 
to extrapolate the FAR out to the maximum MES provided by the planetary signature as the 
highest MES probed by the bootstrap distribution falls short of the 9.7 cr MES of this transiting 
planet in the Kepler data. The red star indicates the estimated bootstrap FAR obtained from the 
Gaussian fit. 


as the white dwarf would need to be about 7000 K to produce an occultat ion of the correct depth , 
and white dwarfs cool from 10,000 K to 6000 K in ~2 Gyr (see, e.g., Goldsburv et al. 2012). 
Thus, there would be only a very brief opportunity for a white dwarf to provide 200-ppm deep 
occultations of KIG 8311864 as the uncertainty in the transit depth is ^10 %. Further, we expec t 
white dwarf/main sequence pairs at a period of 385 days to be relatively rare: [Farmer &: Agoll (j2003l l 
predicted that only five microlensing white dwarf/main sequence binaries would be discovered in 
KepleEs FOV out to periods of ~100 days, and indeed, only one such white dwarf has been found 
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Fig. 5.— Difference image centroiding resul ts for Kepler-452b wit h a J-band infrared image from 
the UK Infrared Telescope Survey fUKIRT: Ihawrence et al.l 120071 1 overlaid. There is a faint, 18**^ 
mag star offset by to the west of KIC 8311864, but neither the difference image centroids in this 
figure, nor the centroid shift analysis provide any evidence suggesting that this star is the source 
of the transit-like features. The four green crosses indicate the difference image centroids for each 
of the four transits of this super Earth. The blue circle delimits the 3-cr confusion radius, which 
contains the target star, KIC 8311864, indicating that the target star is the most likely source of 
the transit features in the Kepler light curve, to the limits of the resolution of the UKIRT images. 


to date ( Kruse fc Agoll 2014) in an 88 day period orbit of a solar-like star exhibiting 
microlensing events and (coincidentally) ~1000ppm deep secondary occupations. 


TOGO ppm 


The brown dwarf scenario is very unlikely to begin with (much more so than the BEB, BP, or 
HTP/HTS scenarios) because of the known paucity of brown dwarfs as spectroscop ic companions to 
main -sequence stars within hundreds of AU, and especially G-type stars (see, e.g.. iMarcv &: Butler 
2000), whereas binaries with main-sequence companions are far more com mon. Only eight tran - 


siting brown dwarf/main sequence bin aries have been discovered to date (IMoutou et al 


four of which are i n the Kepler EOV (jBouchv et al 


2011 


Johnson et al 


2011 


Ipiaz 


et al. 


20131 


2010; 


Moutou et al.ll2013l ). All but one of these have orbital periods under 20 days, and KOI-415b has 
a period of 167 days (IMoutou et al.ll2013l ). Eurther, even the most massive brown dwarf would 
cool below a lumino sity ratio of 100 ppm with the Sun in 1.5 Gyr , and below 25 ppm by 6 Gyr 
(jBurrows et al.ll200lh . Even if such a case were to occur, the shape of the transit-like signals (par¬ 
ticularly the ingress/egress) would not match what we observe because of the much larger size of 
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Fig. 6.— Spectroscopically derived surface gravity as a function of effective temperature (top panel) 
and metallicity (bottom panel) for KIC 8311864, derived using different methods and spectra (see 
text for details). The position of the Sun is also shown. 
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Fig. 7.— Marginalized and full posterior distributions of Markov chain Monte Carlo chains based 
on using spectroscopic Tgff, log g and [Fe/H] to constrain a grid of Dartmouth isochrones. A random 
set of 750 samples (~ 0.1% of all iterations) is plotted for each parameter for clarity. Vertical dashed 
and dotted lines mark the posterior mode and 68% confidence interval around the mode for each 
parameter. Note that age and metallicity are discretely sampled with the resolution of the model 
grid. The top right panel shows the position of KIC 8311864 in a radius temperature diagram, with 
solid black lines showing evolutionary tracks with 0.9 - 1.2 Mq in steps of 0.05 Mq for [Fe/H] = 
0.21. The gray dashed line marks the zero-age main-sequence. 
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Fig. 8.— Images obtained with the NIRC instrument at the Keck II telescope on 2014 June 13 
in J band (left) and K band (right). Neither band indicates the presence of any background stars 
that might be the source of the transits observed in the Kepler photometry of Kepler-452. 


brown dwarfs compared to terrestrial planets like Kepler-NNNb. Thus we restrict our analysis to 
the above BEB, BP, and HTP/HTS conhgurations. 

The general procedure we followed consists of generating realistic transit-like light curves for 
a large number of these false positive scenarios (“blends”), and comparing each of them with the 
Kepler photometry for Kepler-452 to see which provide a fit indistinguishable from that of a model 
of a planet transiting the target star. This comparison allowed us to place constraints on the 
various physical parameters of the blends, including the sizes and masses of the bodies involved, 
their overall color and brightness, the linear distance between the background/foreground eclipsing 
pair and the target, etc. We then used a Monte Carlo approach to estimate the expected rate of 
occurrence of each type of blend (BEB, BP, HTP/HTS) from our simulations, and we compared 
the sum with the expected rate of occurrence of transiting planets similar to the one implied by 
the Kepler-452 signal. If the a priori planet occurrence is much larger than the anticipated total 
blend frequency, then we would consider the candidate to be statistically validated. 


The algorithm used to generate bl end light curves and compare them to the ob served photome¬ 


try in a sense is known as BLENDER ( Torres et al.ll2004l . l201ll : lFressin et al.ll2012l ). which has been 
applied in the past to the validation of many of the most interesting Keyler d iscoveries (see, e.g.. 


Ballard et al.ll2ni3l : lBarclav et al.ll2ni3l : lBorucki et al.ll2ni3l : lMeibom et al.ll2ni3l : ?), in cluding, most 


recently, a sample of twelve small planets in the habitable zone of their parent stars (iTorres et al 
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Fig. 9.— 5 a sensitivity limits on the rejection of background confusion sources in terms of Amag 
as a function of radial offset from KIC 8311864. Left: the dashed-dotted blue curve is the measured 
in-band sensitivities at 1.2/rm, while the solid green curve indicates the corresponding estimated 
sensitivity limits converted to the Kepler bandpass. Right: same sensitivity curves for 2.0 pm 


20151 ). For full details of the technique, we refer the reader to the latter publication. That work 
also explains the calculation of the blend frequencies through a Monte Carlo procedure, making 
use of the constraints provided by BLENDER along with other constraints available from follow-up 
observations. These include the sensitivity to companions at close angular separations from our 
AO imaging (Section [8]), limits on the brightness of unseen companions at even closer separations 
derived from our high-resolution spectroscopy fSection 15.3p . constraints on blended objects from 
our difference image analysis (specifically, the size of the angular 2>a exclusion region; Section Hj3]) . 
and also the brightness measurements for KIC 8311864 as listed in the KIC (which allowed us to 
rule out blends whose combined light is either too red or too blue compared to the observed color 
of the target). 

The time series used for the BLENDER analysis is the detrended Q1-Q17 LC systematic error- 
corrected light curve for KIC 8311864. The stellar parameters adopted for the target were those 
given in Section [6l The parameters of the simulated stars in the v arious false positive conhgurations 
were drawn from model isochrones from the Dartmouth series (iDotter et al.l 120081 ). and rely also 
on the k nown distributions of b i nary properties such as the orbital periods, mass ratios, and eccen¬ 
tricities (iRaghavan et al.l 120101 : iTokovininI l2014l ) . Other ingredients used for the blend frequency 
calculation included the number density of stars in the vicinity of the target from the Besangon 
Galactic structure model ( Robin et al. 2003, 2012 )1^ and the rates of occurrence of eclipsing bina- 


^^http://model.obs-besancon.fr . 
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ries (|Slawson et al.ll201lh and of transiting planets from the Kepler Mission itself (NASA Exoplanet 
Archive, consulted on 2014 March 16)0 

The BLENDER constraints for the BEB, BP, and HTP scenarios are illustrated in Eigures fTOh 
m Our simulations indicate that eclipsing binaries orbiting the target (HTS scenario) invariably 
produce light curves that have the wrong shape for a transit, or feature noticeable secondary 
eclipses that are not seen in the KIC 8311864 photometry. Similarly, blends involving background 
giant stars yield light curves that also have the wrong shape. We therefore restricted our analysis 
to main-sequence stars. Figure [TO] displays a sample cross-section of the blend parameter space for 
the BEB scenario, with the horizontal axis showing the mass of the primary star in the background 
eclipsing binary, and the vertical axis representing the relative distance between the background 
binary and the target. Blends with parameters interior to the white contour give a goodness of 
fit to the Kepler photometry that we considered indistinguishable from that of a transiting planet 
fit (to within 3a). We consider these as viable false positives. Also indicated in the figure are 
regions in which blends have an r — Kg color index that does not match that of the target (see 
figure caption), or where blends are not allowed because the intruding EB would have been detected 
spectroscopically. Similar diagrams for the BP and HTP configurations are shown in Figure [TTl and 
Figure EL Further explana tions regarding the BLENDER constraints may be found in the recent 
work by Torres et al. ( 2015 1. 


Based on the above analysis, and including also the information derived from the follow-up 
observations mentioned earlier, we arrived at estimates of the blend frequencies for the BEB, BP, 
and HTP scenarios of 1.21 x 10“^^, 2.56 x 10“^°, and 2.35 x 10“®, respectiv ely. These estimates 
make use of the frequencies of eclipsing binaries generated by the Kepler team (jSlawson et al.ll201ll ) 
as well as the frequencies of transiting planets of the specific size required to produce the above 
blends, based on the available sample of KOIs from the NASA Exoplanet Archive. The dominant 
contribution to the total blend frequency, as is often the case for Kepler candidates, was from false 
positives involving an unseen companion to the target star that is in a wide orbit and is transited 
by a larger planet (HTP). On the other hand, the expected rate of occurrence of transiting planets 
of the size of Kepler-452b, based again on the available sample of KOIs from the NASA Exoplanet 
Archive, is 9.97 x 10“^. The ‘odds ratio’ is therefore 9.97 x 10“'^/2.35 x 10“® « 424. Consistent with 
previous applications of BLENDER, we adopted a threshold for validation of 1/(1/99.73% — 1) Ri 370, 
corresponding to a 3 a significance. Any odds ratio above this has an even greater confidence level. 
This means that Kepler-452b is firmly validated at the 99.76% level. 


^^http://exoplanetarchive.ipac.caltech.edu . 
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10. Habitability and Composition of Kepler-452b 

Here we examine the probability that Kepler-452b orbits in the habitable zone of its star, 
consider the effects of the unknown eccentricity on the habitability, and estimate the likelihood 
that it is rocky. Of chief interest is whether the insolation flux experienced by Kepler-452b would 
permit water to exist in liquid form on its surface. Secondarily, we wished to determine the 
likelihood that it possesses a rocky surface upon which liquid water could pool. 


10.1. Habitability 


Following t he statistical approach t aken by [Torres et al.l (120151 1 , we took the insolation flux 
levels defined in iKopparapu et al.l (|2013l l for recent Venus and for early Mars to be the inner and 
outer boundaries of the wide or optimistic habitable zone, and the Insolation flux levels for the 
runaway greenhouse and a maximum greenhouse as the inner and outer boun daries for the narrow 
or conservative habitable zone. This was somewhat more conservative, as [Torres et al.l (j2015[ l 
took the inner bo undary of the habitable zone to correspond to that for dry desert worlds from 
Zsom et al.[ (j2ni,'ll ). which permits habitable worlds as close as 0.38 AU from a solar-like host star 
if the albedo is high and the relative humidity is low (~1%). 

The 900,000 MCMC chains for the stellar parameters discussed in Section [6| were convolved 
with realizations of the measured orbital period distribution to construct a population of insolation 
flux levels relative to Earth’s insolation, calculated as 


5 . 


= a 


-2 f ^eff 




TqJ 


R* 

Rq 


( 2 ) 


where ^0 = 5778 K is the solar temperature, Egfr is the effective temperature of the star, a is the 
mean separation between the star and the planet in AU, and 72* is the radius of the star. For each 
realized effective temperature value, the insolation flux values corresponding to the inner bound¬ 


aries of the optimistic and conservative habitable zones were calculated as per [Kopparapu et al 


([20131 1 and the sample was marked as habitable if the insolation was less than that for recent 
Venus/runaway greenhouse. Figure [T3| shows a false color image of the joint distribution between 
insolation and planetary radius for all 900,000 realizations, where the planetary radius samples were 
generated by combining the stellar radius samples with a realization for the transit depth, or ratio 
of the planetary radius to that of the parent star, Rp/R^,. Marginalizing the joint distribution over 
^eff, we find the likelihood that Kepler-452b falls within the wide or optimistic habitable zone is 
96.8%, while the likelihood is 28.0% that it falls within the narrow or conservative habitable zone. 


Given that the SpecMatch results are somewhat conservative with respect to the surface grav¬ 
ity, we repeated the calculations above for the SPC stellar parameters obtained for the HIRES 
spectrum as a means by which to explore the reasonable non-conservative limits. The SPC param¬ 
eters yielded realizations that are within the wide and the narrow habitable zone 99.0% and 59.3% 
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of the time, respectively. 


10.2. Effect of Eccentricity on Habitability 


The eccentricity of the orbit of Kepler-452b is poorly constrained by the photometric measure¬ 
ments, at best. With an orbital period of 384.84 days, and a mass of M 0 (see Section [10.3^ . 
Kepler-452b should exhibit a reflex velocity of ~45cms“^. As the host star is relatively faint 
for such work (Kp = 13.43), there is little chance of measuring its orbit through radial velocity 
observations in the near future. 


To investigate the effect of the uncertainties of the eccentricity of the orbit on the habitability 
of Kepler-452b, we examined the variation of the mean insolation flux as a function of eccentricity. 
Figure [TT] shows the mean, maximum and minimum insolation flux for realizations of an orbit at 
the semi major axis and orbital period of Kepler-452b as a function of the eccentricity from 0 to 
1. While the maximum insolation flux exceeds 4 5® for e > 0.475, the mean flux does not exceed 
the maximum permitted by the optimistic habita ble zone (rec e nt Ve nus) until e exceeds 0.8. The 
probability density for eccentricity developed by iRowe et al.l (j2014l ) based on the population of 
Kepler multiple transiting planets is also shown on t his hgure, il l ustrat ing the fact that transiting 
planets exhibit very low eccentricities in general. I Rowe et al.l (120141 1 modeled the eccentricity 
distribution as a beta distribution /3(x,y) with x = 0.4497 and y = 1.7938. The eccentricity 
distribution-weighted mean insolation flux for Kepler-452b is 1.17 5®, only 9% higher than that 
for e = 0. Thus we conclude that the uncertainties in the eccentricity of Kepler-452b do not 
significantly affect the habitability of its orbit. 


10.3. Composition 


Because Kepler-452b does not have a mass measurement, estimating the probability that it 
is rocky necessarily require s the application of st atistical mass-radius (M-R) relationships. One 
such relation is provided bv lWeiss &: Marcvi (|2014l l. w here the mass of the p lanet is a deterministic 
function of its radius. Alternatively, the analysis of I Wolfgang et al.l (j2015l l yields a probabilistic 
M~R relation that allows a range of masses to exist at a given radius, as we would expect if there 
is a distribution of compositions among the population. Either can be used to pro d uce ( Mp,Rp) 
pairs that can then be compared to the theoretical M-R curves of iFortnev et al.l (120071 ): If the 
radius is smaller at the assigned mass than that for a 100 % silicate planet, as determined by the 
100% rock mass fraction in the rock-iron composition curve (Equation ( 8 ) of that paper), then that 
( Alp, Rp) pair is c o nsiste nt with being rocky. We applied bot h the determinis t ic M- R relationship 
of I Weiss fc Marcvi (l2014l ) and the probabilistic formulation of I Wolf gang et al.l (j2015l ) to investigate 
the likelihood that Kepler-452b is rocky. 


Kepler-452b’s radius is actually a distribution which incorporates measurement uncertainty 
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reflected in the posterior distribution of the MCMC realizations. To calculate the probability that 
it is rocky, we drew a set of radii from this distribution, applied the M~R relations to each to obtain 
mass estimates, and measured the fraction of draws which were consistent with a rocky planet as 
defined above. Here the radius distribution was determined by both the posterior produced 

by the lightcurve fitting of Section [3 and the stellar radius posterior of Section [6] and Figure [3 
drawing randomly from each of these sets of posterior samples and multiplying them then gave the 
set of planet radius posterior samples used for this analysis. 


Because the IWeiss &: Marcvi (|2014l l M-R relation is a deterministic one, applying it to this 
problem is straightforward. However, it does not explicitly capture the intrinsic variability in 
planet masses that we both expect and observe, so the rocky probabilities that it produces may 
be inaccurate and biased. To improve on this, we compute d the posterior predict ive distribution 


for Kepler-452b’s mass given the probabilistic relationship of I Wolfgang et al 


(120151 1. This involved 


drawing posterior samples from the M-R relation hyperparameters (i.e. the power law constant, 
index, and the mass dispersion around this p ower law) that were p roduced by evaluating the hi¬ 
erarchical Bayesian model in this sample 


see 


Wolfgang et al.l 120151 . for more details). This draw 
dehned the M-R relation which then yielded the mass corresponding to the drawn radius. The 
resulting posterior predictive distribution therefore accounts for both the intrinsic, expected dis¬ 
persion in planet compositions in th is small-radius r a nge a nd for the uncertainties in the parameter 
values themselves. As a result, the I Wolf gang et al.l (1201511 approach most accurately incorporates 
the degree of our uncertainty in this radius-to-mass “conversion.” 


Specifically, we took the deterministic mass-density relation from Equations (l)-(3) in IWeiss fc Marcv 


(120141 1: 


Mp = min 


2.43-g 3.39 Rp 
5.513 


RpT 2m 


0.93 


(3) 


where Mp and Rp are the planetary mass and radius, respectively, in Earth units. 


The probabilistic mass-radius relation of I Wolfgang et al.l (j2015l l has the form: 

Mp ~ iV (^ = 2.7 R^-^] 0- = 1.9) , 


(4) 


where N (//; a) denotes a normal (Gaussian) distribution with mean, /x, and standard deviation, a, 
and where Rp and Mp are in Earth units. 


The radius of a pure rock composition planet was calculated as per Eq. 8 in iFortnev et al 


(|2007l l for the estimated planetary mass: 


Rrock = (0.0592 Aock + 0.0975) (log Mp)^ + 

(0.2337 /rock + 0.4938) log Mp+ (5) 

(0.3102/rock+ 0.7932), 


where /rock is the rock mass fraction (set to 1 for this analysis). 
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We computed the probabilities that Kepler-452b is rocky for both the c onservative SpecMatch 
stellar parameters and for the SPC results. Applying the M-R relation of lWeiss &: Marcvi (j2014l l 
gave rocky probabilities of 40% and 64%, respectively. The posterior predictive distr i butio n dis¬ 
played in hgure [15] obtained using the probabilistic M-R relation from IWolfgang et al.l (|2015l i gave 
49% and 62%, respectively. The good agreement between the results for both M-R relationships 
lends conhdence in the results and we estimate the likelihood that Kepler-452b is rocky as between 
that for the SpecMatch and the SPC parameters, namely, between 49% and 62%. 


We note that it is unlikely that Kepler-452b has an Earth-like composition: we computed the 


densit y of Earth-like composition planets for the SpecMatch realizations according to lFortnev et al 


(I 2 OO 7 I. i.e., a rock-iron mi xing ratio of 2/3 in Eg. 8 therein ) and found that only 16% and 22% of 


Weiss fc Marcvi (j201J) and of the lWolfgang et al.l (j2015l ) realizations, respectively, were at least 


the 

as dense as that predicted for Earth-composition planets. 


10.4. History of the Habitability of Kepler-452b 


Given the similarities between the Sun and KIC 8311864 with respect to mass and effective 
temperature, and the fact that the orbital distance of Kepler-452b is essentially 1 AU, it is interesting 
to consider the habitability of this planet over the history of this planetary system. It is important 
to note that statements comparing the age of KIC 8311864 to that of the Sun are tentative, given 
that the estimated age for KIC 8311864 is heavily model dependent and the 2Gyr uncertainty 
estimated in Section [6] does not take into account variations of poorly constrained model physics 
such as convection (mixing length parameter) and microscopic diffusion. In this section, we use 
our best estimate for the stellar age to consider a case study of the evolution of the habitable zone 
over the main-sequence lifetime for stars such as KIC 8311864 and our own Sun. 


We traced the evolution of the effect ive temperature an d the luminosity of KIC 8311864 using 

appropriate to its mass (1.037 Mq) 


the Dartmouth stellar evolution models ( Potter et al 


2008 


14 


and metallicity ([Fe/H] = 0.21). The evolving effective temperature allo wed us to determine the 
inner and outer radii of the conservative and optimistic habitable zone per lKopparapu et al.l (l2013l ) 
as a function of time. The luminosity history obtained from the Dartmouth models allowed us to 
estimate the radiation received by this super Earth over the main-sequence lifetime of its star. 

At ~6 Gyr, KIC 8311864 and its planet are about 1.5 Gyr older than the solar system, and the 
host star’s radius is ~10% larger than that of the Sun. This planet bathes in a stream of radiation 
in its orbit at a level only ~10% higher than that experienced by Earth today. However, since the 
mass of the host star is only ~4% greater than that of the Sun. Kepler -452b spent the first Gyr 
in the conservative habitable zone according to iKopparapu et al.l ()2ni3l ) , as illustrated in Figure [TBl 
This planet will remain in the optimistic habitable zone for another ~3.5 Gyr before the star leaves 


^^http: //stellar .dartmouth.edu / models / 
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the main-sequence and expands rapidly in the ascending red giant branch phase. 

If this exoplanet system is a future version of an analog to our own solar system, with additional 
planets in the orbits corresponding to those of Mars and Venus, their insolation history would be 
very similar to their namesakes in the solar system. An exo-Venus would have spent only the first 
~3 Gyr in the optimistic habitable zone before exiting it for good. In contrast, an exo-Mars would 
be located in the conservative habitable zone during the entire ~11 Gyr spent by the host star on 
the main-sequence. 

At present, Kepler-452b is the only known planet orbiting this star. Assuming that the dis¬ 
persion in orbital inclination angles is the same in the KIC 8311864 system as it is for the solar 
system (~1?9), Kepler would have had only a ~10% chance of observing a transiting exo-Venus 
in this system as well as Kepler-452b. The probability of observing a few transits of an exo-Mars 
in addition to those of Kepler-452b is ~6%, but it would have to have been a rather large planet 
in order for its transits to be readily identifiable in the light curve by eye since it would have pro¬ 
vided fewer than three transits over the course of the Kepler Mission. No such transits have been 
seen. Kepler had only a ~2% chance of seeing transits of both an exo-Venus and an exo-Mars in 
addition to those of the super Earth at 1.05 AU. Until such time as other planets are discovered 
by other exoplanet surveys, or a SETI observation of this planetary system reveals signatures of 
extraterrestrial technologies, we are left to speculate on the fate of an ancient civilization that 
may have developed first on Kepler-452b (or on a moon orbiting it). Eor example, it may have 
subsequently migrated to an as-yet-undetected outer planet to escape the inevitable loss of most of 
the intrinsic water inventory after the moist runaway greenhouse effect took hold at Kepler-452b’s 
orbital distance approximately 800 Myr ago. 


11. Conclusions 


The identification of a 1.6-i?® planet in the habitable zone of a G2 star at a mean separation 
of 1.05 AU represents the closest analog to the Earth-Sun system discovered in the Kepler data set 
to date with respect to the orbit and spectral type of the host star. Moreover, the small radius of 
this planet provides a reasonable chance between 49% and 62% that it is rocky, although in this 
case it is unlikely to have an iron core of significant mass. 


Given our current best estimates the host star is 10% larger and 1.5 Gyr older than the Sun; 
hence, Kepler-452b may represent the future environment that the Earth will experience as our 
own Sun evolves toward the red giant phase. Though it is currently just beyond the moist runaway 
gre enhouse level for the incid ent radiation, this planet is still within the optimistic habitable zone as 


KopparapTi et al.l (j2013l ). and will remain so for another ~3Gyr. However, Kepler-452b likely 


per 

spent the first 
Earth today. 


^5 Gyr in the conservative habitable zone, below the radiation level experienced by 


The paucity of small planets in or near the habitable zone of solar-like stars, particularly G 
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stars, makes the determination of the intrinsic frequency of Earth-size planets in the habitable zone 
of Sun-like stars difficult to determine robustly without resorting to extrapolation. Kepler-452b 
represents an important addition to the list of small planets in or near the habitable zone for its 
proximity to a G2 star. We are hopeful that this is only the first of many such planetary systems 
yet to be discovered in the fullness of time. 
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A. A Computationally Efficient Statistical Bootstrap Test for Transiting Planets 


To search for transit signatures, TPS employs a bank of wavelet-based matched filters t hat form 
a grid on a three-dimensio nal parameter space of transit duration, orbital period, and phase (j Jenkins 


2002 ; 


Jenkins et alJl2010l b A detection statistic is calculated for each template and compared to 
a threshold value of t/ = 7.1 a. Since TPS searches the light curve by folding the single-transit 
detection statistics at each trial orbital period, the detection statistic is referred to as a multiple 
event statistic, or MES. Detections in TPS are made under the assumption that the pre-whitening 
filter applied to the light curve yields a time series whose underlying noise process is stationary, 
white, Gaussian, and uncorrelated. When the pre-whitened noise deviates from these assumptions, 
the detection thresholds are invalid and the false alarm probability associated with such a detection 
may be significantly higher than that for a signal embedded in white Gaussian noise. The Statistical 
Bootstrap Test, or the Bootstrap, is a way of building the distribution of the null statistics from 
the data so that the false alarm probability can be calculated for each TGE based on the observed 
distribution of the out-of-transit statistics. 


Gonsider a TGE exhibiting p transits of a given duration within its light curve. In this analysis, 
the light curve is viewed as one realization of a stochastic process. Further, consider the collection 
of all p-transit detection statistics that could be generated if we had access to an infinite number of 
such realizations. To approximate this distribution we formulate the single event statistic time series 
for the light curve and exclude points in transit (plus some padding) for the given TGE. Bootstrap 
statistics are then generated by drawing at random from the single event statistics p times with 
replacement to formulate the p-transit statistic (i.e., the MES). Since the single event statistics 
encapsulate the effects of local correlations in the background noise process on the detectability of 
transits, so does each individual bootstrap statistic. So long as the orbital period is sufficiently long 
(generally longer than several hours), the single event statistics are uncorrelated and the MES can 
be considered to be formed by p independent random deviates from the distribution of null single 
event statistics. 


Jenkins et al.l (j2002l i formulated a bootstrap test for establishing the confidence level in plan¬ 


etary tr a nsit s ignatures identified in transit photometry in white, but possibly non-Gaussian noise. 


JenkinsI (120021 ) extended this approach to the case of non-white noise. In both cases, the boot¬ 


strap false alarm rate as a function of the MES of the detected transit signature was estimated 
by explicitly generating individual bootstrap statistics directly from the set of out-of-transit data. 
This direct bootstrap sampling approach can become extremely computationally intensive as the 
number of transits for a given TGE grows beyond 15. The number of individual bootstrap statistics 
that can be formed from the m out-of-transit cadences of a light curve and the p transits is m^, 
which is ~ 2.9 x 10'^® statistics for 10 transits and 4 years of Kepler data. An alternative, compu¬ 
tationally efficient method can be implemented by formulating the bootstrap distribution in terms 
of the probability density function (PDF) of the single event detection statistics. The distribution 
for the MES as a function of threshold can then be obtained from the distribution of single event 
statistics treated as a bivariate random process. 
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The MES, Z, can be expressed as 



(Al) 


where S is the set of transit times that a single period and epoch pair select out, C(i) is the 
correlation time series formed by correlating the whitened data to a whitened transit signal template 
with a transit centered at the ith timestep in the set 5, and N(i) is the template normalization 
time series. The square root of the normalization time series, yN(I), is the expected value of the 
MES or S/N for the reference transit pulseEl 


If the observation noise process underlying the light curve is well modeled as a possibly non¬ 
white, possibly non-stationary Gaussian noise process, then the single event statistics will be zero- 
mean, unit-variance Gaussian random deviates. The false alarm rate of the transit detector would 
then be described by the complementary distribution for a zero-mean, unit-variance Gaussian 
distribution: 

Fz(Z) = ierfc(z/V2) , (A2) 

where erfc(-) is the standard complementary error function. If the power spectral density of the 
noise process is not perfectly captured by the whitener in TPS, then the null statistics will not be 
zero-mean, unit-variance Gaussian deviates. A bootstrap analysis allows us to obtain a data-driven 
approximation of the actual distribution of the null statistics, rather than relying on the assumption 
that the pre-whitener is perfect. 


The random variable Z is a function of the random variables corresponding to the correlation 
and normalization terms in the single event statistic time series, Cp = X]jg 5 C(i), and Np = 
N(i). The joint density of Cp and Np can be determined from the joint density of the single 
event statistic components C and N as 


fCp,Np (Cp, Np) 


fc,N (C, N) * /c,N (C, N) * ... * /c,N (C, N), 


(A3) 


where is the convolution operator and the convolution is performed p times. This follows from 
the fact that the bootstrap samples are constructed from independent draws from the set of null 
(single event) statistics with replacement^ Given that convolution in the time/spatial domain 
corresponds to multiplication in the Fourier domain, Eq. IA3I can be represented in the Fourier 
domain as 


‘^Cp.Np = • ‘hc,N • ■ ■ • • ^C,N = 


(A4) 


^®The inverse of \/N(i) can be interpreted as the effective white Gaussian noise “seen” by the reference transit and 
is the definition for the combined differential photometric precision (CDPP) reported for the Kepler light curves at 
3, 6, and 12 hour durations. 

^®In this implementation, we choose to assume that the null statistics are governed by a single distribution. If this 
is not the case (for example, if the null statistic densities vary from quarter to quarter) then Ea. lA3l could be modified 
to account for the disparity in the relevant single event statistics in a straightforward manner. 
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where $c,N = ^ {/c,n} is the Fourier transform of the joint density function /c,n- Here, the 
arguments of the Fourier transforms of the density functions have been suppressed for clarity. The 
use of 2D fast Fourier transforms results in a highly tractable algorithm from a computational point 
of view. 


The implementation of Eq. IA4I requires that a 2D histogram be constructed for the {C,N} 
pairs over the set of null statistics. Care must be taken to manage the size of the histogram to 
avoid spatial aliasing as the use of fast Fourier transforms corresponds to circular convolution. We 
chose to formulate the 2D grid to allow for as high as p = 8 transits. The intervals covered by 
the realizations (i.e., the support) for each of C and N were sampled with 256 bins, and centered 
in a 4096 by 4096 array. When p > 8, it is necessary to implement Eq. IA4I iteratively in stages, 
after each of which the characteristic function is transformed back into the spatial domain, then 
bin-averaged by a factor of 2 and padded back out to the original array size. Care must also be 
taken to manage knowledge of the zero-point of the histogram in light of the circular convolution, 
as it shifts by 1/2 sample with each convolution operation in each dimension. 


Once the p-transit 2D density function fcp,Np (Cp, Np) is obtained, it can be “collapsed” into 
the sought-after one-dimensional density fz{Z) by mapping the sample density for each cell with 
center coordinates {Cj,Nj} to the corresponding coordinate Zjj = and formulating a 

histogram with a resolution of, say, 0.1 <7 in Z by summing the resulting densities that map into 
the same bins in Z. Due to the use of EFTs, the precision of the resulting density function is limited 
to the floating point precision of the variables and computations, which is ~ 2.2 X 10 For small 
p, the density may not reach to the limiting numerical precision because of small number statistics, 
and for large p, roundoff errors can accumulate below about 10“^^. The results can be extrapolated 
to high MES values by htting the mean, p, and standard deviation, a, of a Gaussian distribution 
to the empirical distribution in the region 10“^ < Fz < 10“^^ using the standard complementary 
error function: Fz (Z) = 0.5erfc ((Z — p) 1^/2 a). 


In order to model the use of x-square vetoes (ISeader et al.ll2013l ). we pre-hltered the single 
event statistic time series to remove the three most positive peaks and their “shoulders” down to 2 
a. The three most negative peaks were also handled in a similar fashion to avoid biasing the mean 
of the null statistics in a negative direction. We also identified and removed points with a density 
of zero-crossings that fell below 1/4 that of the median zero-crossing density. This step removed 
single event statistics in regions where the correlation term experienced strong excursions from 
zero due to unmitigated sudden pixel sensitivity dropouts and thermal transients near monthly 
and quarterly boundaries. Typically, more than 99% of the original out-of-transit single event 
statistics were retaine d by these pre-filters for the 16,285 TCEs resulting from the Q1-Q16 planet 
search documented in iTenenbaum et al.l (j2ni4l ). 
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Fig. 10.— Map of the surface (goodness of fit) for KIC 8311864 corresponding to blends 
involving background eclipsing binaries. On the vertical axis we represent the linear distance 
between the BEB and the target (E*beb — Dta.rg)-, cast for convenience in terms of the difference in 
distance modulus, A5 = 5 log (UBEB/Marg)- Only blends within the solid white contour (darker 
colors) provide fits to the Kepler light curve that are within acceptable limits (3(7, where a is the 


signi ficance level of the difference compared to a transiting planet model ht; see iFressin et al 


20121 1. Other concentric colored areas (lighter colors) represent fits that are increasingly worse (4cj, 


5 (7, etc.), which we considered to be ruled out. The blue cross-hatched areas correspond to regions 
of parameter space where the blends are either too red (left) or too blue (right) compared to the 
measured r — Kg color of the target, by more than three times the measurement uncertainty. The 
dashed green line labeled AKp = 8.4 is tangent to the white contour from above, and corresponds 
to the faintest viable blends. The green line labeled AKp = 5.0 represents the spectroscopic limit 
on faint background stars. All simulated blends below this line (green hatched region) are brighter 
and are generally excluded if the BEB is angularly close enough to the target to fall within the slit 
of the spectrograph. 













Distance modulus difference 
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Fig. 11.— Similar to Figure fTOl (and with the same color scheme) for blends involving background 
or foreground stars transited by a planet (BP scenario). The faintest blends giving acceptable fits 
have AKp = 8.0 relative to the target (dashed green line). Blends below the solid green (AKp = 
5.0, hatched green area) are generally excluded by the spectroscopic constraint. 
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Fig. 12.— Similar to Figure fTOl for the case of physical companions to KIC 8311864 that are 
transited by a planet (HTP). Only companion stars with masses larger than about 0.96 Mq or 
smaller than about 0.56 Mq yield blend colors that are consistent with the measured r — Kg index 
of KIC 8311864. However, some of these blends are eliminated by the spectroscopic constraint, as 
they are brighter than AKp = 5.0 (green hatched area). The only false positives that would remain 
undetected (viable) have companion stars smaller than 0.50 Mq and are transited by planets with 
sizes that place them within the white contours. 
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Fig. 13.— The joint density of the planetary radius, Rp, of Kepler-452b and its insolation flux, 565, 
in Earth insolation units, displayed as a false color image. The vertical lines show the insolation flux 
values for the modes of t he MCMC realiza t ions f or the SpecMatch stellar parameters presented in 
Section[6]as developed bv lKoDparapu et al.l (j2013l h the optimistic or wide habitable zone is bounded 
on the inside by recent Venus (solid line) and on the outside by early Mars (dashed dotted line), 
and the conservative or narrow habitable zone is bounded on the inside by the runaway greenhouse 
effect (dashed line) and on the outside by the maximum greenhouse effect (dotted line). 
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Eccentricity 


Fig. 14.— The insolation flux, ^eff, in Earth insolation units received by the planet Kepler-452b 
as a function of orbital eccentric ity, along with a prior distribution for eccentricity for transiting 
planets as per iRowe et al.l (j2014l l . The mean, max, and min insolation flux values are indicated by 
the solid blue, the dashed green and the da shed-dotted red cuiwes, respectively. The optimistic and 
conservative habitable zones, according to iKopparapu et al.l (j2013l l. are indicated by the light and 
dark green shaded regions, respectively. The eccentricity prior is indicated by the solid magenta 
curve with the square symbols. 
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Fig. 15.— The joint density of the planetary radius, Rp, of Kepler-452b and its mass, Mp, in Earth 
units, displayed as a scatter plot with marginal histograms for each quantity for the SpecMatch 
derived para meters. The planetary mass realizations were generated using a hierarchical Bayesian 
model as oer lWolfgang et al.l (|2015l j which explicitly models the planetary mass as a distribution 
for a given planetary radius. The data and histograms ar e colored red for thos e realizations that are 
denser than predictions for 100% silicate planets as per iFortnev et al.l (j2007l b and black for those 
realizations that are less dense and likely to be non-rocky. The fraction of realizations allows us to 
estimate the probability to be 49% that Kepler-452b is rocky for this set of stellar and planetary 
parameters. 
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Fig. 16.— A history of the habitable zone of KIC 8311864 as delineated by the amount of flux 
received by a planet at the orbital distance of Kepler-452b (1.05 AU - essentially the same as that 
of the Earth) over the main-sequence lifetime of its star. This figure plots the radiation received 
by Kepler-452b from 0.25 Gy r to 11 Gyr as a thick b lue curve. The optimistic and conservative 
habitable zones, according to iKopparapu et al.l ([2013), are indicated by the light and dark green 
shaded regions, respectively. The planet is currently experiencing radiation at a level 10% above 
that received by Earth. However, over the first 5.2 Gyr of its history, Kepler-452b was in the 
conservative habitable zone, receiving less than 1.0369 of the flux received by contemporary Earth. 
The red and purple stippled curves represent the flux received by planets in Mars’ and Venus’ orbits 
(1.52 and 0.72 AU), respectively. 























